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§ ■ Abstract 
(N 

^ . The first step in quantum information theory is the identification of entangle- 
^-5 ■ 

lO ■ ment as a valuable resource. The next step is learning how to exploit this re- 

'^ . source efficiently. We learn how to exploit entanglement efficiently by applying 

^Sj . analogues of thermodynamical concepts. These concepts include reversibility, 

o, 

r^ , entropy, and the distinction between intensive and extensive quantities. We 






discuss some of these analogues and show how they lead to a measure of en- 
tanglement for pure states. We also ask whether these analogues are more 



^ . than analogues, and note that, locally, entropy of entanglement is thermody- 

<^ : 

^ . namical entropy. 
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I. INTRODUCTION 

We are familiar with the distinction between "pure" and "apphed" research. In pure 
research, knowledge of the physical world is an end in itself, while in applied research, 
knowledge is only a means to an end. Usually, this distinction is a meaningful one; we 
can easily find examples of pure research that has no applications, and of applied research 
that does not increase our knowledge. But sometimes the distinction is not meaningful. 
For example, what about Carnot's invention of the cycle that bears his name — was it pure 
research or applied research? In this example, the distinction simply doesn't exist. It 
doesn't exist, and not only because Carnot invented his cycle while considering a practical 
engineering problem. It doesn't exist, because his "applied" research on the efficiency of 
heat engines was also essential "pure" research. Indeed, if Carnot had never considered a 
practical engineering problem, he would not have started thinking about limits to efficiency, 
and he would not have discovered the second law of thermodynamics. 

A second example, quite analogous to the first, arises in quantum information theory. 
Research on quantum entanglement led to quantum information theory only after physicists 
found uses for entanglement and thought about how to use entanglement efficiently. Limits 
on the efficient use of entanglement are fundamental to quantum information theory; hence 
applied research on entanglement is also essential pure research in quantum information the- 
ory. The striking analogy between the roles of efficiency in thermodynamics and in quantum 
information theory is the subject of the next section. This analogy leads us to several other 
analogies between the two theories. Sect. Ill shows how analogues of heat engines, entropy, 
the thermodynamic limit and the second law of thermodynamics appear in quantum infor- 
mation theory. The analogies between thermodynamics and quantum information theory 
are so striking that we ask, in the concluding section, if they are more than analogies. Is 
quantum information theory, after all, a branch of thermodynamics? 

II. ENTANGLEMENT AS A RESOURCE 

One of the most salient facts about quantum information theory is the fact that it came 
about recently, and not 60 or more years ago. On the one hand, it could not have come 
before the birth of quantum mechanics in 1926 and the identification of entanglement by 
Schrodinger |jl| and Einstein, Podolsky and Rosen (EPR) [Q in 1935. On the other hand, it 
could have, in principle, come soon after 1935. Why didn't it? What happened instead? 

In the decades following the EPR paper, most physicists ignored it. Bell was one of 
the few who did not; he agreed with EPR that "no reasonable theory" should allow such 
an unreasonable thing as quantum entanglement. Bell [Q published his famous inequalities 
almost 30 years after the EPR paper, and another five years passed until Clauser, Home, 
Shimony and Holt (CHSH) [^] suggested testing Bell's inequalities experimentally. The 
CHSH paper sparked intense interest in entanglement, and the many papers that followed 
it demonstrated again and again, in experiment and in theory, that quantum mechanics is 
unreasonable — but true. Thus entanglement became a wall for physicists to beat their heads 
against. How can the world be so unreasonable? Even physicists who were ready to stop 
beating their heads against the wall lacked a sense of direction. If we cannot understand 



entanglement, at least we could try, for example, to measure it, to quantify it. But various 
proposed measures of entanglement seemed equally plausible 0. 

What provided the direction was not the attempt to understand entanglement, but 
attempts to use it. Consider singlet pairs of spin-1/2 particles or photons shared by two 
remote observers, Alice and Bob; Alice has one particle in each pair, and Bob has the other. 
Alice and Bob can use these pairs in at least two ways. They can construct unbreakable 
codes, and they can teleport quantum states. Quantum cryptography began with a paper 
by Wiesner in 1983; in 1991, Ekert |^ applied entanglement to quantum cryptography. 
Quantum teleportation appeared in a paper by Bennett et al. |^ in 1993. Let us consider 
each of these applications in turn. 

The use of entanglement in quantum cryptography is straightforward. On each shared 
singlet pair, Alice and Bob measure polarization along identical axes. Since the pair is a 
singlet, Alice and Bob will always find the same (for photons) or opposite (for spin-1/2 
particles) polarizations. Either way, Alice and Bob can thus construct identical sequences 
of binary data that only they know. Suppose that Alice wants to send a coded message to 
Bob. First, she translates her message into binary. For example, let the binary message be 
1001011101110001010100010, which contains 25 bits. Next, she and Bob generate a shared 
random binary sequence of the same length. Then Alice adds the two sequences, in binary, 
and transmits the sum to Bob using any (public or private) channel. Finally, Bob subtracts 
the random binary sequence from the sum and recovers Alice's message. Only Bob can read 
Alice's message, because only Bob knows what to subtract. 

Teleportation, as everyone knows, is Captain Kirk's way of getting around. He would 
turn into a column of glimmering light and disappear, only to reappear far away. To explain 
teleportation, we approximate Captain Kirk as a single spin-1/2 particle in an unknown 
state \K): 
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{K stands for Kirk). Suppose Bob wants to transmit the state \K). Alice and Bob join to 
this state a singlet pair, such that the overall state is 
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where A and B stand for Alice and Bob, respectively. Next 
operator with four eigenstates |\E''^~^), |\E'^"'"''), l^*-"-*) and |$ 

I^^-^) = ^[ITa')Ub)-|U)ITb) 



Bob measures a nondegenerate 
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These states are called the Bell operator basis. Rewriting Eq. (Q) in this basis, we obtain 



- ^I*^^^)(a| U) - b\ U)) - l\<^^-^){a\ U) + b\ U)) (4) 

as the state of all three spins before Bob's measurement. Now Bob's measurement leaves 
the two spins at his end in one of the states of the Bell operator basis, and he knows which 
one from his measurement. Bob transmits this information — his two bits — to Alice; he can 
even broadcast the information without knowing where Alice is. When Alice receives the 
two bits, she infers the state of her spin. If the state is a\ ^a) — b\ I a), she rotates her spin 
vr around the z-axis. If it is a\ Ia) + b\ Ia), she does nothing. If it is a\ Ia) — b\ ^a), she 
rotates her spin vr around the y-axis. If it is a\ I a) +b\ ^a), she rotates her spin vr around the 
X-axis. In each case, the final state of her spin is a replica of Captain Kirk, a\ j^) + b\ Ia)- 
No trace of Captain Kirk remains with Bob. 

Hence every singlet pair that Alice and Bob share is a valuable resource, which Alice 
and Bob can use to encode one bit of a message or to teleport a single qubit. Also, they 
can use any pair that is related to a singlet pair by local unitary transformations. We call 
such a pair an ebit. For these applications of entanglement, only ebits will do; pairs in other 
entangled states would, in general, yield errors in transmission of a message or a qubit. But 
if Alice and Bob share pairs in other entangled states, they can, with a certain probability, 
extract ebits from these pairs. Here's how: Suppose Alice and Bob share the entangled state 

l^«) = «|Ta)ITb) + (i -«')'/'! UIU , (5) 

with a real and a > l/v2- Since a is too large, either Alice or Bob must reduce a by local 
filtering j^. So let Alice, say, run her spin through a selective filter that never absorbs the 
state I I a), but sometimes absorbs the state | j^). Let |0) denote the initial state of the 
filter and define local filtering according to a unitary operator U that sends 

|Ta)|0)--x|Ta)|0) + 2/|Ta)|1) , 

|U)|0)^|U)|0) . (6) 

Here |1) represents the state of the filter if it absorbs Alice's spin, and |xp + |?/p = 1. After 
Alice runs her spin through the filter, the combined state of the two spins and the filter is 

xa\ U)\ U) + (1 - c^Y'l U)\ Ib)] |0) + ya\ U)\ Tb)|1) • (7) 

Now Alice looks in the filter. The chance is jayp that she finds her spin there. If she 
does not find her spin in the filter, however, she knows that the state of the two spins is 
given by the bracketed term in Eq. (^, up to normalization. In particular, if we choose 
X = (1 — a'^Y^'^/a, the state of the two spins will now be an ebit, suitable for coding and 
teleportation. So Alice has a chance 1 — |a?/p = 2(1 — a^) of producing an exact ebit. Of 
course, she loses all the entanglement in {"^a) if the filter absorbs the | t^) state, but with 
a little luck, Alice and Bob can turn pairs in the state |\E'o) into a valuable resource. (It 
follows that pairs in the state |\E'q) are a valuable resource, too.) 

So we can extract singlets from other entangled pairs! Immediately, the question arises — 
just as it did for Carnot — whether local filtering is the most efficient method for extracting 



singlets. The answer is that, in general, it is not. If Alice and Bob share many pairs in the 
state I'^a), they can do better than locally filter the pairs, one by one. But to do better, 
Alice and Bob must apply collective operations to their entangled pairs — they must operate 
on many pairs together, and not one by one. Suppose Alice and Bob share two pairs in the 
entangled state |^I/o). The state of two pairs is 

a\ U)\ Tb) + (1 - aY'\ U)\ Ib)] [a\ U')\ U') + (1 - c^'Y^'l U')\ Ib')] , (8) 

where A and A' refer to Alice's spins, and B and B' refer to Bob's. Expanding Eq. (|^), we 
obtain 
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as the overall state. Now let Bob measure erf + erf , or Alice measure a^ + a^ . (The result 
is the same.) With probability 2a^(l — a^) the result is zero, and the state of the spins after 
the measurement is the bracketed term in Eq. (j^). Now the bracketed term is an ebit; we 
can define 

\U) = \U)\U') , \U) = \U)\U') 

\i^B)^-\U)\iB') , \ilB)^\iB)\U') (10) 

to write it explicitly as a singlet. 

For two pairs, this method is not more efficient than local filtering. But it gets more 
efficient as Alice and Bob apply it collectively to more pairs at a time ^. To apply the 
method to many pairs at a time, Alice and Bob first measure the 2;-component of total spin 
of the pairs. Whatever result they get, their pairs are left in a superposition of biorthogonal 
states with coefficients of equal magnitude. For example, let Alice and Bob have three 
distinct pairs in the state |^I/o). Suppose Alice measures the z-component of total spin to 
be 1/2; then the state of their pairs is 

I U)\ U)\ U')\ U')\ iA")\ Ib") + 1 UI Ib)\ U')\ U')\ U")\ U") 

+ \U)\U)\Ia')\Ib')\U")\U") , (11) 

up to normalization. Each time they measure the 2;-component of total spin on a set of 
pairs, they get a state of this form. The tensor product of such states is therefore also a 
state of this form. For example, two groups of three spins, each in the state of Eq. ([TlD , yield 
a tensor product state having nine terms with equal coefficents. Alice and Bob can build 
such states, with various numbers of terms in the superposition, until they get a number 
of terms equal to, or nearly equal to, a power of 2. A superposition with number of terms 
equal to 2" is unitarily equivalent to n ebits. Thus Alice and Bob can apply local unitary 
operations to transform their superposition into ebits. For example, Alice and Bob could, by 
locally filtering a state having nine terms, reduce nine terms to eight terms. Eight terms are 
unitarily equivalent to three ebits. Bennett, Bernstein, Popescu and Schumacher (BBPS) 
showed that Alice and Bob can obtain n singlets from k pairs of spins in the state |^l/o), 
where the ratio n/k approaches the limit 



lim l = E{\^^)) 

= -aHog^a^-{l-a')\og^{l-a') . (12) 

E^l^a)) is called the entropy of entanglement; it is the Shannon entropy of the squares of 
the coefficients of the Schmidt decomposition. It equals 1 if a = 1/v^ and equals for a 
product state. 



III. THERMODYNAMICAL ANALOGUES 

Applications of entanglement naturally raise the question of efficiency. We have two 
methods to extract singlets from generic entanglement; we know they are not equally ef- 
ficient. Are there more efficient methods? Is there a maximally efficient method? These 
questions are analogous to the questions that Carnot asked. We will now see also that the 
answers are analogous to his answers. 

First, the answer to the thermodynamical question involves a principle — the principle 
that there is no way to build a perpetuum mobile, i.e. to build a machine that works for 
free, without changing its environment. This principle is the second law of thermodynamics. 
For entanglement, there is an analogous principle: local operations cannot increase the 
entanglement between remote systems 11^,11] . Measurements, local unitary operations, and 



additional unentangled systems cannot increase the entanglement between Alice's and Bob's 
systems; neither can classical communication (i.e. communication that does not involve 
entanglement) between Alice and Bob. We can accept this principle as an axiom, or we can 



prove it as a theorem of quantum mechanics |TT|. It is analogous to the second law also 
in that it is a statistical law. The method of local filtering, for example, can increase the 
entanglement between the systems held by Alice and Bob, but on average it decreases the 
entanglement. 

Second, Carnot had the insight to focus on reversible transformations. Consider two 
reversible heat engines; suppose that both absorb heat Qi at Ti and expel heat Q2 at T2, 
but one does work W, and the other does work W' > W, per cycle. The first engine, if run 
in reverse, is a refrigerator — absorbs heat Q2 at T2 and expels heat Qi at Ti — and requires 
only work W per cycle. Thus the two engines together could provide W — W in work per 
cycle without changing their environment. Such a conclusion contradicts the second law of 
thermodynamics, so both engines must do the same work: W = W. 

Are the collective operations of BBPS reversible? Alice and Bob can turn shared singlets 
into shared pairs in the state I^E'q). Alice, say, can prepare pairs in the state {"^a) in her 
laboratory, and then teleport one spin out of each entangled pair to Bob. However, Alice 
then uses up one singlet pair for every spin that she teleports to Bob, so Alice and Bob use 
up k shared singlets to produce k shared pairs in the state |\E'a), whereas from k pairs in 
the state {"^a) they can recover only n < k singlet pairs. So this is not an efficient way to 
produce pairs in the state {"^a)- But Alice can teleport the pairs more efficiently using a 



method called quantum data compression |12|. The idea behind quantum data compression 
is as follows. Alice has to teleport k spins, i.e. a state in a 2'^-dimensional Hilbert space. 
But the effective dimension of the Hilbert space is much smaller than 2^, because the k 
spins have a common bias. In the state |\I'a) with a > 1, | |) is more likely than | |), so a 



sequence with every spin in the state | |) is much more hkely than a sequence with every 
spin in the state | |); still more likely are sequences with most, but not all, spins in the state 
II). In fact, the effective dimension of the Hilbert space approaches 2", rather than 2^^; that 
is, Alice can actually teleport the k spins to Bob without using more than the n singlets 
that she and Bob can obtain from k pairs in the state l^&a). Hence the extraction of singlets 
from pairs in the state |\I/q,) is reversible. It is not reversible for finite k and n, in general, 
but it is reversible as the number of systems approaches infinity, just as heat engines are 
reversible only in the thermodynamical limit. 

Now suppose that Alice and Bob share k pairs of systems in an entangled state l^l/a), 
which they transform into n singlets, using the method of BBPS. Did they use the most 
efficient method possible? That is, could Alice and Bob apply a more efficient method, using 
only local operations and classical communication, to obtain a greater number n' > n oi 
singlets from the same number k of initial pairs? The answer is that they cannot. For if it 
were possible to transform k of the initial pairs into n' singlets by a different method, Alice 
and Bob could then reverse the BBPS operations on n of the singlets and transform them 
into k pairs in the entangled state {"^a)- They could then obtain n' — n entangled pairs 
only using local operations and classical communication, contradicting the general principle 
that local operations cannot increase entanglement. Hence n' = n; the BBPS method is 
maximally efficient (in the limit k,n ^ cxd). 

As a byproduct, this proof gives us a measure of the entanglement of the state l^^a)- The 
k systems in the state |\1/q,) have the same entanglement as n singlet pairs. Thus the measure 
of entanglement for k pairs in the state |^q,) must equal the measure of entanglement for n 
singlets. At first, it might seem that we could assign an arbitrary measure of entanglement, 
such as n, ti? and e", to n singlets. But actually, the measure must be proportional to 
n, because the BBPS collective operations are reversible only when the number of systems 
becomes arbitrarily large. (The ratio n/k nearly always tends to an irrational number, and 
if the number is irrational, we can never reversibly extract n singlets from a finite number 
k of systems.) Reversibility requires us to go to the limit of infinite n, and for infinite n 
there is no way to define total entanglement. We can only define entanglement per system. 
Here too, we find a thermodynamical analogue: the thermodynamic limit requires us to 
define intensive quantities. Likewise, the measure of entanglement must be intensive, i.e. 
the measure of entanglement of n singlets must be proportional to n. It follows that the 
measure of entanglement for pure states is unique (up to a constant factor). Since the 
measure of entanglement of k systems \^a) approaches the measure of entanglement of n 
pairs in a singlet state, and since the measure is intensive, we have kE{\^a)) = n-, where E 
now denotes the measure, and the measure of entanglement of a singlet state is 1. Thus 

E{\^^))= hm I . (13) 

This limit indeed equals the entropy of entanglement of |^l/o), Eq. (|I^; so the measure of 
entanglement of |^a) niust equal its entropy of entanglement, up to a conventional propor- 
tionality constant — measuring the entanglement of a singlet pair or ebit — that we set it to 
1. 



IV. BEYOND ANALOGUES 

Thermo dynamical analogues are powerful tools for quantum information theory. How- 
ever, so far they remain mere analogues. Feynman [|1^] constructed an amusing analogue of 



the Carnot cycle to prove that gravitational potential energy near the surface of the earth is 
the product of weight and height. But his proof does not make gravity a branch of thermody- 
namics! So thermodynamical analogues in quantum information theory are not necessarily 
more than analogues. In particular, the previous section did not mention temperature, heat 
or heat baths in the context of entanglement. We did not need to ask how much work, if 
any, the BBPS method entails. So far, we have no reason to consider quantum information 
theory a branch of thermodynamics. I claim, however, that it is. 

In a Carnot cycle, the entropy of the heat engine changes twice per cycle. The heat engine 
absorbs entropy at the high temperature and releases entropy at the low temperature. At 
first glance, the flow of entropy in the Carnot cycle seems not to fit the thermodynamical 
analogues: entropy changes in the Carnot cycle, while BBPS collective operations conserve 
the entropy of entanglement. A closer look, however, reveals yet another analogue. The heat 
engine indeed absorbs and releases entropy, but so does the environment, such that the total 
entropy is unchanged. Otherwise, the heat engine would not be reversible. Likewise, the 
BBPS operations would not be reversible if they did not conserve entropy of entanglement. 
We can therefore guess that the systems shared by Alice and Bob are the analogue, not of 
the heat engine, but of the heat engine plus environment. 

How does this analogy work? Initially, say, Alice and Bob share k pairs of spins in an 
entangled state {"^a)- From these k pairs they extract, by the BBPS method, n ebits, with 
n < k. Aside from the n ebits, then, there remain k — n spins; by conservation of entropy 
of entanglement, these k — n spins contain zero entanglement. Thus, the extraction of ebits 
suggests a process in which an ensemble of spins at thermodynamical equilibrium divides 
into two ensembles, with differing average entropy per spin. If we now visit, say, Alice's 
laboratory, and forget about Bob, we cannot help but describe the process as heating and 
cooling of two subensembles: the k — n spins are in a pure state, hence they are "cold" , 
while the n ebits are in state of maximum entropy, hence they are "hot". The inverse of 
the BBPS method, in which n ebits are combined with k — n pairs of spins (shared by Alice 
and Bob) in product states, to yield k equally entangled pairs, is a process in which two 
ensembles at different temperatures are reversibly brought to the same temperature. 

We might conclude, then, that the BBPS method consumes work — precisely the amount 
of work required to separate the ensemble at thermodynamical equilibrium reversibly into 
"hot" and "cold" ensembles — and that the inverse of the BBPS method produces work — 
precisely the amount of work produced when the "hot" and "cold" ensembles come reversibly 
into thermodynamical equilibrium. However, this conclusion is premature. The reason is 
that our use of the word "reversibility" in the context of entanglement does not quite match 
its usage in thermodynamics. A local increase of entropy in Alice's laboratory, or in Bob's, 
may conserve the entanglement of the pairs they share, but it is not thermodynamically 
reversible. We must check whether the collective operations of the BBPS method have local 
thermodyamical effects that we have not taken into account fl^ . We have not done so here. 



because we have treated collective operations abstractly, without considering their physical 
implementation. But we can already conclude that entropy of entanglement is more than 



an analogue of thermodynamical entropy; locally, it is therniodynamical entropy. 

It may seem paradoxical that Alice's spins can have entropy and temperature (if we 
forget Bob) yet they belong to a pure state (if we remember Bob) . But such is the nature of 
entanglement. Consider a closed system comprising a measuring device, a measured system, 
and an environment, in an initial pure state. After the measurement, which entangles the 
measuring device and the measured system, decoherence sets in. The process of decoherence 
does not change the fact that the closed system is in a pure state; it merely makes the fact 
irrelevant, for all practical purposes, because no one can keep track of it. Furthermore, even 
though the system as a whole is in a pure state, its entangled subsystems are not. The 



objective — not subjective — entropy of the subsystems derives from entanglement [|T5[. In 
just this sense, Alice's and Bob's spins have objective entropy. 
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